Update indexing strategy recommendations #2408

shubhaat · 2025-08-05T00:02:54Z

Indexing strategy recommendations in accordance with small indices causing a large overhead

Indexing strategy recommendations

github-actions · 2025-08-05T00:04:45Z

🔍 Preview links for changed docs

deploy-manage/cloud-organization/billing/elasticsearch-billing-dimensions.md

deploy-manage/cloud-organization/billing/elasticsearch-billing-dimensions.md

kilfoyle · 2025-08-05T14:12:57Z

deploy-manage/cloud-organization/billing/elasticsearch-billing-dimensions.md

+    * To ensure optimal performance and cost-effectiveness for your project, it’s important to consider how you structure your data.
+        * Consolidate small indices for better efficiency. We recommend avoiding a design where your project contains hundreds of very small indices, specifically those under 1GB each.
+    * Why is this important?
+         * Every index in Elasticsearch has a certain amount of resource overhead. This is because Elasticsearch needs to maintain metadata for each index to keep it running smoothly. When you have a very large number of small indices, the combined               overhead from all of them can consume more CPU resources than if the same data were stored in fewer, larger indices. This can lead to higher resource consumption and hence higher costs and potentially impact the overall performance of your project.
+
+    * Recommended Approach
+        * If your use case naturally generates many small, separate streams of data, we advise implementing a process to consolidate them into fewer, larger indices. This practice leads to more efficient resource utilization. By grouping your data               into larger indices, you can ensure a more performant and cost-efficient experience with Elasticsearch Serverless.


Suggested change

* To ensure optimal performance and cost-effectiveness for your project, it’s important to consider how you structure your data.

* Consolidate small indices for better efficiency. We recommend avoiding a design where your project contains hundreds of very small indices, specifically those under 1GB each.

* Why is this important?

* Every index in Elasticsearch has a certain amount of resource overhead. This is because Elasticsearch needs to maintain metadata for each index to keep it running smoothly. When you have a very large number of small indices, the combined overhead from all of them can consume more CPU resources than if the same data were stored in fewer, larger indices. This can lead to higher resource consumption and hence higher costs and potentially impact the overall performance of your project.

* Recommended Approach

* If your use case naturally generates many small, separate streams of data, we advise implementing a process to consolidate them into fewer, larger indices. This practice leads to more efficient resource utilization. By grouping your data into larger indices, you can ensure a more performant and cost-efficient experience with Elasticsearch Serverless.

* To ensure optimal performance and cost-effectiveness for your project, it’s important to consider how you structure your data.

* Consolidate small indices for better efficiency. We recommend avoiding a design where your project contains hundreds of very small indices, specifically those under 1GB each.

* Why is this important?

* Every index in {{es}} has a certain amount of resource overhead. This is because {{es}} maintains metadata for each index to keep it running smoothly. When you have a very large number of small indices, the combined overhead from all of them can consume more CPU resources than if the same data were stored in fewer, larger indices. This can lead to higher resource consumption and hence higher costs, and can also impact the overall performance of your project.

* Recommended Approach

* If your use case naturally generates many small, separate streams of data, we advise implementing a process to consolidate them into fewer, larger indices. This practice leads to more efficient resource utilization. By grouping your data into larger indices, you can ensure a more performant and cost-efficient experience with {{es-serverless}}.

kilfoyle · 2025-08-05T14:15:56Z

Looks good @shubhaat! 🚀
I added some small suggestions. Here's how it'll appear:

…-dimensions.md Co-authored-by: David Kilfoyle <[email protected]>

shubhaat · 2025-08-05T20:51:29Z

@kilfoyle Thanks indeed, I saw the formatting weirdness. Would you be kind to approve and we can merge and have this out and about.

kilfoyle · 2025-08-05T21:11:27Z

deploy-manage/cloud-organization/billing/elasticsearch-billing-dimensions.md

+         * Every index in Elasticsearch has a certain amount of resource overhead. This is because Elasticsearch needs to maintain metadata for each index to keep it running smoothly. When you have a very large number of small indices, the combined               overhead from all of them can consume more CPU resources than if the same data were stored in fewer, larger indices. This can lead to higher resource consumption and hence higher costs and potentially impact the overall performance of your project.
+
+    * Recommended Approach
+        * If your use case naturally generates many small, separate streams of data, we advise implementing a process to consolidate them into fewer, larger indices. This practice leads to more efficient resource utilization. By grouping your data               into larger indices, you can ensure a more performant and cost-efficient experience with Elasticsearch Serverless.


Suggested change

* If your use case naturally generates many small, separate streams of data, we advise implementing a process to consolidate them into fewer, larger indices. This practice leads to more efficient resource utilization. By grouping your data into larger indices, you can ensure a more performant and cost-efficient experience with Elasticsearch Serverless.

* If your use case naturally generates many small, separate streams of data, we advise implementing a process to consolidate them into fewer, larger indices. This practice leads to more efficient resource utilization. By grouping your data into larger indices, you can ensure a more performant and cost-efficient experience with {{es-serverless}}.

kilfoyle · 2025-08-05T21:12:05Z

deploy-manage/cloud-organization/billing/elasticsearch-billing-dimensions.md

+    * To ensure optimal performance and cost-effectiveness for your project, it’s important to consider how you structure your data.
+        * Consolidate small indices for better efficiency. We recommend avoiding a design where your project contains hundreds of very small indices, specifically those under 1GB each.
+    * Why is this important?
+         * Every index in Elasticsearch has a certain amount of resource overhead. This is because Elasticsearch needs to maintain metadata for each index to keep it running smoothly. When you have a very large number of small indices, the combined               overhead from all of them can consume more CPU resources than if the same data were stored in fewer, larger indices. This can lead to higher resource consumption and hence higher costs and potentially impact the overall performance of your project.


Suggested change

* Every index in Elasticsearch has a certain amount of resource overhead. This is because Elasticsearch needs to maintain metadata for each index to keep it running smoothly. When you have a very large number of small indices, the combined overhead from all of them can consume more CPU resources than if the same data were stored in fewer, larger indices. This can lead to higher resource consumption and hence higher costs and potentially impact the overall performance of your project.

* Every index in {{es}} has a certain amount of resource overhead. This is because {{es}} needs to maintain metadata for each index to keep it running smoothly. When you have a very large number of small indices, the combined overhead from all of them can consume more CPU resources than if the same data were stored in fewer, larger indices. This can lead to higher resource consumption and hence higher costs and potentially impact the overall performance of your project.

kilfoyle

Thanks @shubhaat. I've approved.

Please add in my two suggestions though, just to remove extra spacing and use our variables for the product names.

Update indexing strategy recommendations

e646993

Indexing strategy recommendations

shubhaat requested a review from a team as a code owner August 5, 2025 00:02

kilfoyle reviewed Aug 5, 2025

View reviewed changes

deploy-manage/cloud-organization/billing/elasticsearch-billing-dimensions.md Outdated Show resolved Hide resolved

kilfoyle reviewed Aug 5, 2025

View reviewed changes

Update deploy-manage/cloud-organization/billing/elasticsearch-billing…

f089367

…-dimensions.md Co-authored-by: David Kilfoyle <[email protected]>

shubhaat requested a review from kilfoyle August 5, 2025 20:51

kilfoyle reviewed Aug 5, 2025

View reviewed changes

kilfoyle approved these changes Aug 5, 2025

View reviewed changes

Merge branch 'main' into indexing-strategy-recommendations

abc5880

shubhaat merged commit a6d19bf into main Aug 5, 2025
8 checks passed

shubhaat deleted the indexing-strategy-recommendations branch August 5, 2025 21:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update indexing strategy recommendations #2408

Update indexing strategy recommendations #2408

Uh oh!

shubhaat commented Aug 5, 2025

Uh oh!

github-actions bot commented Aug 5, 2025 •

edited

Loading

Uh oh!

Uh oh!

kilfoyle Aug 5, 2025

Uh oh!

kilfoyle commented Aug 5, 2025

Uh oh!

shubhaat commented Aug 5, 2025

Uh oh!

kilfoyle Aug 5, 2025

Uh oh!

kilfoyle Aug 5, 2025

Uh oh!

kilfoyle left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	* If your use case naturally generates many small, separate streams of data, we advise implementing a process to consolidate them into fewer, larger indices. This practice leads to more efficient resource utilization. By grouping your data into larger indices, you can ensure a more performant and cost-efficient experience with Elasticsearch Serverless.
	* If your use case naturally generates many small, separate streams of data, we advise implementing a process to consolidate them into fewer, larger indices. This practice leads to more efficient resource utilization. By grouping your data into larger indices, you can ensure a more performant and cost-efficient experience with {{es-serverless}}.

Update indexing strategy recommendations #2408

Update indexing strategy recommendations #2408

Uh oh!

Conversation

shubhaat commented Aug 5, 2025

Uh oh!

github-actions bot commented Aug 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔍 Preview links for changed docs

Uh oh!

Uh oh!

kilfoyle Aug 5, 2025

Choose a reason for hiding this comment

Uh oh!

kilfoyle commented Aug 5, 2025

Uh oh!

shubhaat commented Aug 5, 2025

Uh oh!

kilfoyle Aug 5, 2025

Choose a reason for hiding this comment

Uh oh!

kilfoyle Aug 5, 2025

Choose a reason for hiding this comment

Uh oh!

kilfoyle left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

github-actions bot commented Aug 5, 2025 •

edited

Loading